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How natural selection can promote cooperative or altruistic behavior is a fundamental question in biological 
and social sciences. One of the persuasive mechanisms is "indirect reciprocity," working through reputation: 
cooperative behavior can prevail because the behavior builds the donor's good reputation and then s/he 
receives some reciprocal benefits from someone else in the community. However, an important piece missed 
in the previous studies is that the reputation-building process requires substantial cognitive abilities such as 
communication skills, potentially causing a loss of biological fitness. Here, by mathematical analyses and 
individual-based computer simulations, we show that natural selection never favors indirect reciprocal 
cooperation in the presence of the cost of reputation building, regardless of the cost-to-benefit ratio of 
cooperation or moral assessment rules (social norms). Our results highlight the importance of considering 
the cost of high-level cognitive abilities in studies of the evolution of humans' and animals' social behavior. 

Cooperative interactions among genetically unrelated individuals are a fundamental aspect of human 
society. However, cooperation accrues a cost, c, to the donor of the cooperation while conferring a benefit, 
b, to another individual (b > c > 0). What mechanism enables the evolution of this costly behavior, i.e., 
cooperation? This issue has been of considerable concern in both social and biological sciences 16 . 

One of the proposed mechanisms for the evolution of cooperation is "indirect reciprocity" working through 
reputation 711 . That is, cooperative behavior can prevail because the behavior builds the donor's good reputation 
and then he or she receives some reciprocal benefits from someone else in the community. For example, if an 
individual A helps B, another individual C observes the cooperation; C builds and spreads A's good reputation, 
and then another individual D helps A by referring to A's reputation. This account is quite powerful because this 
does not require kinship, a spatial (group)-structure, or repeated interactions among individuals to explain the 
emergence of cooperation. 

It is important to note that the indirect reciprocity account relies on individuals' substantial cognitive abilities 
for reputation processing (e.g., communication skills, capacity for judgment based on social norms). High-level 
intelligence has been thought to be very costly in terms of biological fitness 12,13 . In other words, natural selection 
does not favor the evolution of intelligence unless excessive benefit is present. Although the concept of indirect 
reciprocity is based on humans' high-level intelligence, which potentially causes a loss of biological fitness, to our 
knowledge no study has investigated the effect of the cost of reputation processing on the evolution of indirect 
reciprocity. 

In this study, we examine the evolution of indirect reciprocity in light of the cost of reputation building 
(spreading). By mathematical analyses and individual-based computer simulations, we demonstrate that the 
slight cost of reputation building completely destroys indirect reciprocal cooperation regardless of the cost-to- 
benefit ratio of cooperation or moral assessment rules (social norms). 

Results 

Let's consider a population comprising an infinite number of individuals. Each individual in the population has a 
reputation, either good or bad. For each round t = 1, 2... in a generation, each individual randomly finds an 
opponent and plays a one-shot prisoner's dilemma game. In this game, two individuals in a pair simultaneously 
choose to either "cooperate" or "defect". Cooperation confers a benefit, b, to the recipient while accruing a cost, c, 
to the donor (b > c > 0). In contrast, defection yields nothing to either person. Moreover, each individual's 
behavior in the game is witnessed by an observer who decides whether to build and spread the donor's reputation 
at a cost c R (> 0). Finally, at the end of the generation, each individual leaves offspring depending on his/her 
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fitness defined as the total payoff during the generation (i.e., natural 
selection). Higher fitness implies a higher probability that the indi- 
vidual can leave more offspring. 

How does an observer build the donor's reputation? In other 
words, what behavior of the donor is interpreted as good or bad 7 . 
We assume that an observer build the donor's reputation based on 
the donor's own reputation and behavior and the opponent's repu- 
tation 14 , and that all individuals in the population share the same 
moral assessment rule 9,14-16 (see Methods). Furthermore, each indi- 
vidual's reputation at the initial round is assumed to be good. 

In the present study, as in the previous studies 9,15,17 , we assume that 
each individual makes decisions in the prisoner's dilemma game 
based on the opponent's reputation. Such a decision-making rule, 
called a behavioral strategy, is denoted by a two-dimensional vector, 
P = (Pb>Pg)> where p B andp G e [0,1] indicate the probability that the 
individual cooperates when his/her opponent's reputation is bad and 
good, respectively. In addition, each individual in the role of the 
observer decides whether to build and spread the reputation of the 
donor. By q e [0,1], we denote the probability that the individual 
builds the donor's reputation. That is, each individual's strategy is 
defined as a pair of p and q, indicated by (p, q). 

Let G t (p, q) be the fraction of individuals among (p, q) strategists 
whose reputation is good at round t, and let x(p, q) be the probability 
density of (p, q) strategists over the population. Then, the average 
payoff of (p* q*) strategists at round t is: 



n,{p*,q*) = b 



pil-G t (p*,q*),G t (p*,q*))x(p,q)dpdq- 



(1) 



cp*-{\-G t ,G t )-c R q*, 



where G f = 



G ( (p,q)x(p,q)dpdq denotes the fraction of individuals 



whose reputation is good in the whole population. The first term of 
equation (1) represents the benefit of cooperation multiplied by the 
probability that the focal (p* q*) strategist is cooperated with by 
other individuals; the second term denotes the cost of cooperation 
multiplied by the probability that s/he cooperates, and the third term 
is the cost of reputation building multiplied by the probability that s/ 
he builds the donor's reputation when s/he is in a role of an observer. 

Since we can show that the reputation dynamics, G f (p* q*), 
depends on p* but not on q*, for any moral assessment rules 11,14 
(see Methods), G ( (p* q*) in equation (1) can be replaced by 
G t (p*), that is: 



n t {p*,q*) = b§pil-G t (p*),G t (P*))x(P,<l)dpdq- 
cp*-(l-G t ,G t )-c R q*, 



(2) 



Given that in equation (2) only the third term depends on q*, we can 
rewrite equation (2) as follows: 

K<(p*,q*) = J t {p*)-c R q*, (3) 

where Tt' ( (p*) indicates the payoff in the prisoner's dilemma game, 
which depends on p *but not q*, and c R q* denotes the cost of repu- 
tation building, which depends on q* but not p*. 

Equation (3) indicates that, given a value of p, individuals with q = 
0 always get the highest payoff in each round, regardless of the cost- 
to-benefit ratio of cooperation (c/fo), the amount of the cost of repu- 
tation building (c R > 0), or the moral assessment rules (update rule of 
Gf(p, q))- This means individuals with q = 0 always get the highest 
fitness, sum of the payoffs, for any p. In other words, natural selection 
always favors individuals who never build and spread a reputation 
(i.e., q = 0) for any behavioral strategies, p. Note that this statement is 
also true when we assume more general behavioral strategies, that is, 
when individuals make decisions based not only on the opponent's 
reputation but also on their own reputation (see Methods) 14,18 . In 
summary, we show that, in the presence of the cost of reputation 



building, natural selection results in a society where no individual 
builds a reputation and thus indirect reciprocity never works. 

To further explore the effect of the cost of reputation building in 
more complicated situations, we constructed an individual-based 
computer simulation model. Let us consider a population of n indi- 
viduals. As in the seminal work on indirect reciprocity 7 , each player 
has a reputation score, s, that ranges from —5 to 5. At the beginning 
of each generation, which comprises m x n consecutive rounds, 
scores are reset to zero. 

In each round, three individuals are chosen randomly from the 
population: one is a potential donor of help, another is a potential 
recipient of the help, and the third is an observer. A potential donor 
decides to either give help (cooperation) or to refuse help (defection). 
Cooperation confers a benefit, b, to the recipient while accruing a 
cost, c, to the donor (b > c > 0). Defection yields nothing to either 
person. Note that we assume that, with the probability s (0 < S <C 1), 
an individual who intends to cooperate fails to cooperate because of, 
for example, a lack of resources or a mistake, called an implementa- 
tion error 9,16,19 . After this, the observer decides whether or not to 
update and spread the donor's reputation score (i.e., reputation 
building) with a cost c R > 0. 

As in the analytical model, we assume all individuals in the popu- 
lation share the same moral assessment rule (social norm). We con- 
sider the following three rules. One is the simple, called SCORING 7 , 
in which the reputation score increases by one unit if a potential 
donor cooperates; it decreases by one unit if s/he defects. That is, 
cooperation and defection are judged as good and bad, respectively. 
Another rule is MILD (or sometimes called STANDING) 11 - 16 - 20 : the 
score increases if a donor defects against a bad individual (s < 0) or 
cooperates; it decreases otherwise. This rule incorporates the concept 
of justified defection 7,11 and has been known to stabilize the indirect 
reciprocal cooperation 9,16,18,20 . The third rule is STERM (or some- 
times called KANDO.R0 9,11,21 : the score increases if a donor coop- 
erates with a good individual (s >= 0) or defects against a bad 
individual; it decreases otherwise. This includes not only the concept 
of justified defection but also unjustified cooperation (i.e., coopera- 
tion with a bad individual is regarded as bad) 11 . 

A donor's behavioral strategy is given by a number, k (k = — 5 to 
6): an individual with this strategy, k, cooperates if the score of the 
recipient is at least k. In other words, a high reputation score of an 
individual often implies a higher probability that others will coop- 
erate with the individual. An observer's strategy is depicted by a 
number, q e{0,1}: an individual with q = 1 (0) does (does not) 
update and spread the donor's reputation score based on the moral 
assessment rule. That is, each individual's heritable traits are k and q. 

At the end of each generation, each individual leaves offspring 
depending on his/her fitness (i.e., natural selection). The fitness value 
is defined as the total payoff received during the generation 
(m X n rounds). Higher fitness implies a higher probability that 
the individual can leave offspring. We use the "binary tournament 
selection" procedure, a genetic algorithm, to select individuals 22 . In 
addition, mutation is introduced: with the small probability /.(, each 
individual's strategy k and q changes to another value randomly. 

Consistent with the previous studies 7,8 , we find that, in the absence 
of cost of reputation building, the frequency of cooperation increases 
with the decrease in the cost-to-benefit ratio of cooperation and the 
increase in the number of rounds in a generation (see the black points 
in Fig. 1). This tendency is common to all the three moral assessment 
rules (compare the panels A vs. B vs. C in Fig. 1). This indicates that 
the evolution of indirect reciprocal cooperation becomes easy as the 
net benefit of cooperation and the number of interactions among 
individuals increase. 

However, the presence of the cost of reputation building comple- 
tely destroys indirect reciprocal cooperation regardless of the cost- 
to-benefit ratio of cooperation or moral assessment rules (see the red 
points in Fig. 1). Even when the amount of cost of reputation building 
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Figure 1 | Effect of cost of reputation building, c R , on the evolution of cooperation. Black points indicate the case without the cost of reputation 
building; red represents the case with the cost (triangles: c R = 0.1; squares: c R = 0.01; these symbols overlap). The frequency of cooperation at the 1,000th 
generation is plotted as a function of cost of cooperation, c (benefit of cooperation, b, is fixed at 1; population size n = 200; probability of implementation 
error e = 0.05; mutation rate \i = 0.01; an individual's strategy at the first generation is determined randomly). Each point denotes the values averaged 
over 200 computer simulation runs. (A) Individuals use the moral assessment rule, SCORING. Top row: the average number of rounds for each individual 
in a generation, m, is 3; Middle row: m = 5; Bottom row: m = 7. (B) MILD. (C) STERM. 



is only 1% of the benefit of cooperation, the evolution of cooperation 
is impossible (red squares in Fig. 1). We also examined another case 
in which the reputation of a donor is built by the recipient instead of a 
third person/observer. That is, the recipient decides to or not to incur 
a cost of building the reputation of the donor. We confirm that, 
consistent with the original case, introducing the cost of reputation 
building makes it the evolution of indirect reciprocal cooperation 
impossible (see Suppl. Fig. SI). 

We also investigated a situation in which an observer who does not 
build the donor's reputation can lose his/her own good reputation. In 
other words, individuals who do not incur a cost of reputation build- 
ing can be judged as bad. We assume that, at each round, in addition 
to a donor, a recipient and an observer, one individual is selected as 
an observer of the observer, who we call a "second-order observer." 
S/he decides whether or not to update and spread the observer's 
reputation score with a cost Cr. > 0. A second-order observer's strat- 
egy is depicted by a number, r e{0,l}: an individual with r = 1 (0) 
does (does not) build the observer's reputation. When a second- 
order observer's strategy is r = 1, the (first-order) observer's repu- 
tation score increases by one unit if s/he builds the donor's repu- 
tation; it decreases by one unit otherwise. The results of the computer 
simulation show that, consistent with the original case, natural selec- 
tion never leads to indirectly reciprocal cooperation regardless of the 
cost-to-benefit ratio of cooperation or moral assessment rules (see 
the red points in Suppl. Fig. S2), except for the unrealistic case with- 
out the cost of reputation building for second-order observers (i.e., 
c R - = 0; see the black points in Suppl. Fig. S2). This is because 
individuals building a reputation as a second-order observer (i.e., 
r = 1) are exploited by individuals with r = 0 for the cost, c R ' (see 
Suppl. Fig. S3). 



These results together demonstrate that, as predicted by the ana- 
lytical models, the evolution of cooperation based on indirect reci- 
procity is extremely vulnerable to the cost of reputation building in a 
complicated situation (e.g., a reputation score is 11 -scale but not 
binary 23 ). 

Discussion 

We have examined the effect of the cost of reputation building on the 
evolution of cooperation through indirect reciprocity. By analytical 
investigation and individual-based computer simulations, we have 
shown that the slight cost of reputation building completely destroys 
indirect reciprocal cooperation regardless of the cost-to-benefit ratio 
of cooperation or moral assessment rules. 

In a broad sense, our results are closely related to the second-order 
free rider problem 24,25 , which asks "who incurs a cost to preserve 
social systems for the maintenance of cooperation?". A typical 
example is the evolution of altruistic punishment, which asks 26 " 28 
"who bears a cost of the punishment of defectors?". Imagine that 
there are three types of individuals: free riders (defectors), altruistic 
punishers (cooperators who punish free riders) and cooperators who 
do not punish, and that there is a cost to administering punishment. 
In this situation, although altruistic punishers punish free riders with 
a cost, cooperators never bear the cost. In other words, cooperators 
can "take a free ride" on punishers' punishment of free riders, the so- 
called second-order free ride. Natural selection therefore favors 
cooperators and leads to the extinction of altruistic punishers, and 
then, after the extinction of the punishers, defectors finally dominate 
the population of cooperators. 

In a context of indirect reciprocity, it has been demonstrated that 
cooperation can evolve through indirect reciprocity given an appro- 
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priate reputation system 7,11 . However, little attention has been paid to 
the issue of who covers the costs of maintaining the reputation sys- 
tem. As shown in the present study, in the presence of the cost, the 
reputation system is no longer sustainable and thus indirect recip- 
rocal cooperation vanishes. The present results shed light on the 
possibility that the second-order free rider problem is inherent in 
various contexts other than altruistic punishment. 

One caveat to our conclusion is that other mechanisms may sup- 
port the evolution of cognitive abilities to build reputations and thus 
indirect reciprocity. One candidate is multilevel (group) selection 29 " 31 . 
Suppose that individuals of a population are subdivided into groups 
and interact within these groups, and that individuals who build 
reputations are concentrated together within the same groups. 
Then, groups of individuals who build reputations can achieve 
high-level indirectly reciprocal cooperation and thus, despite the cost 
of reputation building, outperform the other groups of individuals 
who do not build reputations. Consistent with this conjecture, it has 
been demonstrated that, in the absence of costs of reputation build- 
ing, multilevel selection supports the evolution of a moral assessment 
rule under which indirectly reciprocal cooperation is evolutionarily 
stable 21 . 

Another candidate is network reciprocity 5,3233 . In a typical setting 
considered in studies of network reciprocity, individuals of a popu- 
lation occupy the vertices of a graph and the edges determine who 
interacts with whom. That is, each individual plays a game only with 
her neighbors and mimics the most successful neighbor's strategy. In 
this setting, individuals who build reputations can form a cluster in 
which indirect reciprocity is established, and thus might be able to 
prevail in the population. 

Moreover, it might be possible that building reputations is bene- 
ficial enough to overcome the cost in situations that differ from those 
considered in our study. Indeed it has been mathematically shown 
that, when individuals engage in several games, an evolutionary out- 
come of a single game cannot be predicted without assessing the 
structures of the other games 34 . How these mechanisms complement 
indirect reciprocity in the presence of the cost of reputation building 
will require further investigation. 

Further, it is worth noting that indirectly reciprocal cooperation 
does not necessarily rely only on reputation building. For example, if 
the population is small enough, each individual can judge others' 
goodness by direct observation, instead of the indirect observation 
via reputation. Moreover, experimental studies on humans and rats 
demonstrated another type of indirect reciprocity: an individual is 
more likely to help anonymous others when s/he received help in 
previous interactions, compared with when s/he did not 35 37 . This 
type of reciprocity, called upstream indirect reciprocity, seems to be 
driven by a feeling of gratitude rather than reputation building. 
However, to date, few studies have provided theoretical explanations 
of the evolutionary origin of upstream indirect reciprocity 38-40 . 

Humans' sociality is supported by their sophisticated cognitive 
abilities, e.g., language skills, which are biologically costly 12,13 . 
Despite the importance of the cost of intelligence in human evolu- 
tion, little is known about the effects of this cost on the evolution of 
cooperation (but see references 41,42 ). This study is, to our knowledge, 
the first to assess the relationship between the cost of high intel- 
ligence and the evolution of indirect reciprocal cooperation, high- 
lighting the importance of considering the costs of high-level 
cognitive abilities in studies of the evolution of humans' and animals' 
social behavior. 

Methods 

Reputation dynamics in the basic analytical model. Here we show that the 
dynamics of the fraction of individuals among (p, q) strategists who have a good 
reputation, G t (p, q), depends on p, but not on q. 

We can consider various ways to update reputations, called moral assessment 
rules 7,11,14,18 . One way is that cooperation (defection) is simply regarded as good (bad) 
irrespective of the donor's or the opponent's reputation. Another way is to introduce 



the concept of justified defection: defection against a bad opponent is judged as good. 
Moreover, one might judge cooperation with a bad opponent as bad, which is the 
concept of unjustified cooperation. Taking these possibilities together, we assume that 
individuals in a community share the same moral assessment rule and they judge a 
donor's goodness based on the donor's own reputation and behavior and the oppo- 
nent's reputation. 

A moral assessment rule is then represented by an eight-dimensional vector, R = 
(Rbdb, Rbdg, Rbcb, Rbcg, Rgdb, Rgdg, Rgcb, Rgcg)- The subscript letters indicate, 
from the left to the right, the donor's reputation, the donor's behavior and the 
opponent's reputation, respectively, and R e [0, 1] denotes the probability that the 
donor's behavior is judged as good in each of the eight situations. For example, 
(0,0,1,1,0,0,1,1) indicates IMAGE SCORING 43 in which cooperation (defection) is 
simply regarded as good (bad). Further, R*db — 1 is consistent with the concept of 
justified defection, and R*cb ~ 0 reflects the concept of unjustified cooperation (* is a 
wild card). 

At round f, consider the probabilities that: (l)abad(p,q) strategist defects against a 
bad opponent, (1 — G t (p, q)) (1 — p B ) (1 — G ( ); (2) a bad (p, q) strategist defects 
against a good opponent, (1 — G t (p, q)) (1 — Pg) G t ; (3) a bad (p, q) strategist 
cooperates with a bad opponent, (1 — G t (p, q))pB (1 — G f ); (4) a bad (p, q) strategist 
cooperates with a good opponent, (1 — G t (p, q)) pc G t \ (5) a good (p, q) strategist 
defects against a bad opponent, G t (p, q) (1 — p B ) (1 — G ( ); (6) agood (p, q) strategist 
defects against agood opponent, G t (p, q) (1 — p G ) G t ; (7) agood (p, q) strategist 
cooperates with a bad opponent, G t (p, q) p B ( 1 — G f ); and (8) a good (p, q) strategist 
cooperates with a good opponent, G t (p, q) pG G t . These probabilities are represented 
by a vector, 5. Since each component of 5 is a function of G t (p, q) and p, we rewrite S as 
S[G,(p, q),p]. 

Recall that the reputation of an individual is updated only when the observer 
decides to build that individual's reputation, the probability of which is 

E(q) — Qqx(p,q)dpdq. Hence, the fraction of individuals among (p*, q*) strategists 

whose reputation is good at round t+ 1 can be written as: 

G l + l (p*,q*)=E(q)S[G,(p*,q*),p*]R T + (l-E(q))G t (p*,q*). (4) 

Given equation (4) and Gj(p, q) = 1 for any (p, q), we can see that G t + i(p*-> q*) 
depends on p* but not q* for any moral assessment rules. 

In conclusion, the fraction of individuals among (p, q) strategists who has a good 
reputation at round t is determined by p but not by q. 

Extended analytical model. Here we assume that each individual makes decisions 
based not only on the opponent's reputation but also on his/her own reputation. In 
this case, a behavioral strategy is denoted by a four- dimensional vector, p — (Pbb>Pbg> 
Pgb>Pgg)> where the left subscript letter denotes the focal individual's own reputation, 
the right denotes the opponent's reputation, and p represents the probability that s/he 
cooperates given his/her own reputation and that of the opponent. 

Here, the average payoff of (p* q*) strategists at round t can be calculated as: 

JJ (5) 
Q(p^)Gf(p*^*))^^)dp^-cp*-(B f (p* ; g*)B ( ,B ( (p*g*)G t) 

G t (p* y q*)B t ,Gt (p*,q*)G t ) - c R q*, 

where B t = 1 — G t denotes the fraction of individuals who have a bad reputation. In 
the same way as that in the basic model, we can show that the reputation dynamics 
Gt(p* q*) depend on p*but not on q*. Hence, we can show that, given a value of p, 
individuals with q — 0 always get the highest payoff in each round, regardless of the 
cost-to-benefit ratio of cooperation {c/b), the amount of the cost of reputation 
building {c R > 0), or moral assessment rules. In other words, natural selection always 
favors individuals who never build and spread a reputation, and thus indirect reci- 
procity never works. 
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